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We set out to gain deeper insight Into the potential of antibody light chain variable domains (V^^s) as Immunothera- 
peutics. To this end, we generated a naive human V,^ phage display library and, by using a method previously shown 
to select for non-aggregating antibody heavy chain variable domains (VHs), we Isolated a diversity of V,^ domains by 
panning the library against B cell super-antigen protein L. Eight domains representing different germline origins were 
shown to be non-aggregating at concentrations as high as 450 jjlM, Indicating VL repertoires are a rich source of non- 
aggregating domains. In addition, the V^^s demonstrated high expression yields in £. coli, protein L binding and high 
reversibility of thermal unfolding. A side-by-side comparison with a set of non-aggregating human V^s revealed that 
the V|^s had similar overall profiles with respect to melting temperature (TJ, reversibility of thermal unfolding and resis- 
tance to gastrointestinal proteases. Successful engineering of a non-canonical disulfide linkage in the core of V^^s did not 
compromise the non-aggregation state or protein L binding properties. Furthermore, the introduced disulfide bond 
significantly increased their T^s, by 5.5-17.5 °C, and pepsin resistance, although It somewhat reduced expression yields 
and subtly changed the structure of V^^s. Human V^^s and engineered versions may make suitable therapeutics due to their 
desirable biophysical features. The disulfide linkage-engineered V^^s may be the preferred therapeutic format because of 
their higher stability, especially for oral therapy applications that necessitate high resistance to the stomach's acidic pH 
and pepsin. 



Introduction 

As antibody-based therapeutics, full-length monoclonal 
antibodies have little competition so far.''^ In fact, most approved 
monoclonal antibodies and those in regulatory review are 
canonical IgG antibodies (www.landesbioscience.com/journals/ 
mabs/about/) . The disadvantages of these molecules, such as costly 
and time-consuming production in mammalian cell lines, large 



(-150 kDa) and complex molecular structures, poor tissue 
penetration and inability to access cryptic epitopes, and the 
fact that the Fc portion of the antibody is not needed in many 
instances or may even be harmful, have resulted in the creation 
of a niche that can be occupied by antibody fragments. These 
smaller antibody fragments, including single-domain antibodies 
(sdAbs), have unique features that may make them the preferred 
therapeutic format for many applications. Currently, there are 
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numerous antibody fragments in clinical development, with 
some being sdAbs.^'* 

sdAbs, e.g., human V^jS, human Vj^s, camelid V^Hs, have 
become a viable option in the antibody-based therapeutic tool 
box that also includes IgGs, antigen-binding fragments (Fabs), 
single chain Fv fragments (scFvs), and their many derivatives. 
Appealing features of sdAbs include their high affinity (nM - 
pM equilibrium dissociation constant (K^) range) for cognate 
antigens, small size (-15 kDa) and simple structure, single 
domain nature, modularity, low immunogenicity, high-level 
expression in microorganisms such as bacteria, high thermal, 
chemical and protease stabilities, high solubility and aggrega- 
tion resistance, ability to access cryptic epitopes, and ease of 
genetic manipulation and display library construction.''^"'^' 
V^FIs are more convenient to obtain due to their better bio- 
physical properties and the existence and accessibility of in vivo 
naive and immune V^H repertoire sources, but human V^^s and 
Vj^s have the perceived advantage of being less immunogenic in 
human therapy. A number of reports have implied human V^^s 
may be superior therapeutic candidates compared with human 
Vj^s because of their lower tendency to aggregate, which 
may translate to lower immunogenicity and in turn higher 
therapeutic efficacy for V^^s. 

In vivo, human constructs are the result of genetic recom- 
bination between germline gene segments V^^ and J^^. The first 
two complementarity-determining regions (CDRl and CDR2) 
and a part of the CDR3 up to residue 95 are encoded by V^^ 
segment genes; the rest of the CDR3 and the entire framework 
region (FR) 4 are encoded by J^^ gene segments. Human V^^s 
are classified as either k or \ subtypes, with seven gene seg- 
ment subgroups (V^l-7) within the k class and 11 V^^ gene seg- 
ment subgroups (Vj^l-11) within the X. class (http://www.imgt. 
org/).^*"'^^^ In general, domains exhibit higher solubility and 
stability than Vj^ domains, possibly due to a higher packing den- 
sity in their upper core and a more hydrophilic C-terminus, and 
among the subgroups, V^3 subgroup members exhibit the 
best properties in terms of solubility and thermodynamic stabil- 
jj-y 33,38 ^ significant proportion of human V|^s, predominantly 
of class, bind to the B cell super-antigen protein L.^' '" 

domains are similar to V^s in terms of overall structure. 
They are composed of two (3-sheets that are formed by several 
anti-parallel |3-strands and pack face-to-face to form (3-sandwich 
structures.*^ Also, similar to V^^s, they possess a pair of cyste- 
ine residues at spatially equivalent positions (Kabat positions 23 
and 88)'" that form a highly conserved disulfide linkage. This 
linkage, which pins together the two j3-sheets in the core of 

domains, plays a critical role in maintaining the structural 
integrity of V^^s.''''''" Previously, it was shown that engineering 
an additional disulfide linkage in the core of a set of human V^jS 
improved their aggregation resistance and thermostability.**'*^ 
Given the overall structural similarity between V^^ and V^^s, it is 
hypothesized that the same engineering approach, with similar 
stability improvements, should be applicable to V^^s. 

Here, to further explore the merits of human V^^s as thera- 
peutic modalities, we set out to perform an extensive biophysi- 
cal characterization of a set of test V^^s. We constructed a naive 



human V^^ phage display library, and from it isolated a diversity 
of domains with protein L binding property by a phage selection 
method that was previously shown to be highly selective for non- 
aggregating human domains.*** We then characterized a rep- 
resentative sample of Vj^s for properties such as expression yield, 
non-aggregation, thermal stability, reversible thermal unfolding, 
structural integrity, and protease resistance. Next, we determined 
if a non-canonical disulfide linkage engineering approach previ- 
ously shown to improve the thermostability and protease resis- 
tance of camelid V^jHs and human Vj^s*"^'*^'*'"'^ would do the 
same for the present V^^s. Thus, we engineered human V^^s with 
an additional, non-canonical disulfide linkage between Cys48 
and Cys64 in |3-strands C and D. We then performed pair-wise 
biophysical comparisons between wild-type and their corre- 
sponding Cys mutant domains. 

Results 

Identification and sequence analysis of human Vj^s 
Essentially the same selection method employed to isolate 
non-aggregating V^^s from a human V^^ phage display library was 
applied to a human V^^ library for isolating soluble, monomeric 
Vj^s.*** A human V^^ library with a size of 3 x 10'' transformants 
was constructed. Twenty-four clones (plaques) from the library 
titer plates were isolated and their V^^ genes were amplified by 
PGR and then sequenced. The sequences were diverse in terms of 
germ-line origin, although 75% of the V^^s were of V^^ origin (data 
not shown). Three rounds of panning against protein L resulted 
in enrichment for large plaques. Thirty-four of the large plaques 
were sequenced and 32 unique sequences were identified (Fig. 1). 
Except for HVLP389, which is from the X class (subgroup V^^l, 
V germline lb), the remaining 31 Vj^s belonged to the class. 
Of the 31 K class V^^s, 24 fell within the V^3 subgroup and 7 fell 
within the V^l subgroup. Sixteen of the 24 V^3 sequences utilized 
the L6 V germline sequence, while the remaining sequences uti- 
lized A27, L2, and L16 V germline sequences. The V^l subgroup 
Vj^s originated from the 02/012 or A30 V germline sequence. 
Noticeable mutations occurred at position 96. The germline 
amino acids at this position are aromatic and hydrophobic amino 
acids Trp, Phe, Tyr, Leu or He for k class Vj^s, and Tyr, Val or Ala 
for X class V^^s. In the selected pool of k class V^^s, however, only 
5 out of 31 V^s had their germline amino acids at position 96: 
HVLP325, HVLP349, HVLP388, HVLP3109, and HVLP393; 
21 V^s had charged amino acids, of which 20 were positively 
charged, 2 had Pro, 1 had Gin, 1 had Ser, and one had Thr at 
positions 96. Moreover, 18 Vj^s of the k class had their last three 
germline residues (105-107) replaced with amino acids Thr, Val, 
and Leu, which are only found in X class V^^s. 

Expression yield and aggregation status of human Vj^s 
Eight of the selected V^^s representing different V germline 
origins were expressed in E. colt TGI in 1 L cultures and puri- 
fied: HVLP324, HVLP325, HVLP335, HVLP342, HVLP351, 
HVLP364, HVLP389, and HVLP3103 (Fig. 2A; Table 1). All 
were expressed in good yields ranging from 6.2 mg for HVLP325 
to '75 mg for HVLP335 and HVLP364. The aggregation 
tendency of the human V^^s was assessed by Superdex 75™ 
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Figure 1. Amino acid sequences of V^^s selected from a human V^^ phage display library by panning against protein L. The dots in the sequence 
entries indicate amino acid identity with HVLP333. Dashes are included for sequence alignment. See V BASE (http://vbase.mrc-cpe.cam.ac.uk/index. 
php?&iVll\/lN_position=1:1) for sequence numbering and CDR and FR designations. L6, A27, L2, L16, 02/012, ABO, and lb are V germline gene segment 
designations. J germline gene segment designations are in brackets. NF, not found. 



size-exclusion chromatography (SEC).''^ At a concentration of 
0.6 mg/mL (43 |xM) all V^^s were essentially free of aggregates and 
gave single, symmetrical peaks (Fig. 3A). HVLP351, HVLP342, 
HVLP335 and HVLP3103, were still monomers when tested at 
their highest concentration available, i.e., 0.89 mg/mL (64 |jlM), 
1.0 mg/mL (72 |jlM), 4.9 mg/mL (352 jjlM), and 5.9 mg/mL 
(430 ixM), respectively, although slight tailing was observed 
for the HVLP335 monomeric peak at 5.9 mg/mL, suggesting 
interaction with the column matrix. The apparent molecu- 
lar masses (M s) of V, s, calculated from their elution volumes 
(Fig. SI), ranged from 6.9 kDa to 24.3 kDa, with a mean M 
± SEM of 13.7 ± 2.2 kDa and a M, ± SEM of 13.8 ± 0.04 koT 

ror 

Variation in ^^p^s for non-aggregating V^^s with similar formula 
molecular masses (M^^s) has been reported previously.^^ Such 
variation was also observed in the case of highly non-aggregating 
V^s and may have been the result of weak transient interactions 
with the column materials or monomer/dimer equilibria.'' The 
non-aggregating status of V^^s was confirmed by a surface plas- 
mon resonance (SPR) assay based on the Ni^*-Hisg tag interac- 
tion that involved flowing His-tagged V^^s over Ni^* -immobilized 
sensorchip surfaces.*'' Similar to previous results obtained with 
monomeric V^jS, for the two dissociation phase windows tested, 
all the C-terminally HiSj,-tagged V^^s gave k^^^ that are very 



similar to those of the monomeric V^^H control, but drastically 
faster than those for the dimeric V^j control with two C-terminal 
HiSj, tags, confirming the monomeric status of the V^^s (Fig. 3B). 
This conclusion was accurately endorsed by the multiangle light 
scattering (MALS) experiments, which showed that the V^^s had 
^MALs* (molecular masses determined by MALS) that were very 
similar to their Afj-^^s (Table 1). 

Protein L binding of human Vj^s 

As anticipated, all selected Vj^s bound to protein L in SPR 
analysis (Fig. S2A; Table 1). The K^s of binding to protein L 
were in the 0.2-3 |xM range, with HVLP324 and HVLP342, 
which belong to V^l subgroup, showing additional smaller K^s 
of 0.07 |JlM and 0.04 |xM, respectively, when Biacore analy- 
ses were performed at a low concentration range (1—10 nM) 
(Table 1). The estimated stoichiometry of binding, V^^iprotein 
L, were 7 for V^l subgroup members, 3 for all V^3 sub- 
group members except for HVLP351 which was 2, and 1 for 
HVLP389, a V,l member. 

T* s of human V, s 

To assess the thermostability of the Vj^s, 7^s of V^^s were deter- 
mined based on ellipticity data assuming a two-state system, 
which is in agreement with the observed denaturation curves 
corresponding to a sharp transition into denaturation (Fig. 4A). 
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Figure 2. Structure of a set of human V^^s chosen for detailed biophysical analyses. (A) Amino acid sequences of eight human V^^s and their Cys mutant 
versions. The amino acid positions (48 and 64) where mutations to cysteine residues were made are marked. (B) Homology structures of wild-type (left) 
and Cys mutant (right) of HVLP324. The native and engineered non-canonical disulfide linkages are shown as light grey and dark grey spheres, respec- 
tively. Protein homology structures were obtained using the GenoBD automatic modeling tool (http://pbil.ibcp.fr/htm/index.php). The figures were 
drawn with PyMOL (www.pymol.org) and Disulfide by Design (version 1.20) freeware.^" 



T^s, taken at midpoint of the sigmoidal denaturation curves of 
molar ellipticity change (AG) vs. temperature (°C), were in the 
range of 51.9-68.5 °C (median T = 61.9 °C) (Table 1). The 
lowest 7^ was that of HVLP389, which is the only Vj^ of \ family 
among the eight V^^s, and the highest 7^ was that of HVLP325, 
which showed slight aggregation (Fig. 3A; Table 1). The 7" s of 
eight non-aggregating Vj^s (HVHP428, HVHP413, HVHP414, 
HVHP421, HVHP429, HVHP44, HVHP420, and HVHP419) 
isolated previously in the same manner as Vj^s''* were also deter- 
mined for comparison. The 7^^ range was 54.2—64.2 °C and the 
median T was 57.6 °C; the T differences between V, s and 

m m L 

V^s were not significant (Mann-Whitney test, two-tailed; P = 
0.2345) (Fig. 4Band D). 



Thermal refolding efficiency of human Vj^s 

The ability of the human Vj^s to refold following thermal 
denaturation was assessed by determining their thermal refold- 
ing efficiencies (TREs), which is calculated as the ratio of the 
K^s for the binding of the native V^^s to protein L (TT^n) to the 
K^s for the binding of the heat-denatured/cooled (refolded) V^^s 
to protein L (K^ief) K^s were determined by SPR (Fig. 5; 
Fig. S3; Table 1). Figures 5A and B compare the protein L bind- 
ing sensorgram profiles for HVLP335 and HVLP325, respec- 
tively, in native and refolded states at 20 jjlM concentrations. 
It can be seen that for HVLP335, with near perfect refolding 
(95%) (Table 1), the sensorgrams for the native and refolded spe- 
cies are almost superimposable. In contrast, for HVLP325 with 
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Table 1. Biophysical characteristics of V^^s 





group 


M 

for 

(kDa) 


(kDa)" 


yield (mg)'' 


(pM) 


V|^:protein 


T 
t 

(°C) 


AT 

(°C) 


TRE 


(%)" 


Gl protease resistance (%)' 


4 

pM 


20 
pM 


Trypsin 


Chymotrypsin 


Pepsin 


HVLP324 


V 1 


13.83 


13.84 
±0.76 


2.5,7 


0.2, 
0.07' 


7:1 


66.1 


J 3 


90 


78 


34 


100 


22 


HVLP324S 


13.87 


14.68 
±0.58 


0.5, 1.1 


0.1, 
0.06' 


7:1 


73.4 


ND9 


ND9 


32 


36 


46 


HVLP325 


V D 


13.61 


13.5 ± 
0.63 


0.5, 6.2 


1 


3:1 


68.5 


1 A 
1 ^ 


94 


65 


67 


76 


34 


HVLP325S 


13.66 


13.35 
±0.64 


2.2,3.1 


1 


3:1 


82.5 


ND3 


NDs 


81 


93 


100 


HVLP335 


V J 


13.90 


15.26 
±0.50 


73.5 


2 


3:1 


61.7 


1 / .J 


92 


95 


2 


59 


0 


HVLP335S 


13.94 


16.8± 
0.47 


5.5 


2 


3:1 


79.0 


ND9 


NDf 


0 


32 


62 


HVLP342 


V 1 


13.95 


14.22 
±0.64 


1,7.7 


0.6, 
0.04' 


7:1 


58.4 


5 4 


92 


70 


25 


73 


3 


HVLP342S 


13.99 


17.02 
±0.40 


1.7, 10.8 


0.2, 
0.05' 


7:1 


63.8 


ND9 


NDs 


5 


40 


5 


HVLP351 


V D 


13.85 


14.60 
±0.61 


1.2, 8.9 


2 


2:1 


62.0 


Q Q 


87 


65 


71 


100 


31 


HVLP351S 


13.89 


14.61 
±0.53 


1 .9, 4.8 


0.7 


2:1 


71.9 


NDs 


ND9 


100 


99 


94 


HVLP364 


V D 


13.95 


14.20 
±0.58 


0.3, 77 


3 


3:1 


57.0 


1 D.J 


100 


92 


32 


58 


0 


HVLP364S'' 


13.97 


14.98 
±0.43 


4.7 


NDf 


NDf 


72.3 


ND9 


NDf 


0 


12 


0 


HVLP3103 


V 3 


13.85 


14.29 
±0.64 


0.7, 19 


1 


3:1 


65.7 


1 0.7 


94 


92 


81 


90 


0 


HVLP3103S 


13.89 


14.05 
±0.68 


6.5 


1 


3:1 


76.4 


ND9 


ND9 


47 


80 


89 


HVLP389 


vj 


13.69 


15.12 
±0.50 


3,16.7 


1 


1:1 


51.9 


14.4 


91 


91 


15 


74 


0 


HVLP389S 


13.72 


13.65 
±0.68 


6.5 


1 


1:1 


66.3 


ND9 


NDf 


64 


90 


11 



chiometry of V^-protein L binding; ''Thermal refolding efficiency at 4 and 20 |jM V,^ concentrations, respectively; 'Percentage proteolytic resistance values 
are at protease concentrations of 10 pg/mL (see also Fig. 6E); 'Smaller values correspond to the binding of HVLP324, HVLP324S, HVLP342, and HVLP342S 
to the high affinity sites on protein L; 'ND, not determined; ''HVLP364S additionally gave a significant second peak (Fig. 3A) with a smaller elusion volume 



65% TRE (Table 1), a significant drop in binding is observed 
for refolded species compared with the native species when sen- 
sorgrams at the same concentrations are compared. At 4 |jlM V^^ 
concentrations, TREs are in the range of 90—100%, except for 
HVLP351, which is slightly lower at 87%, indicating a near per- 
fect refolding for V^s (median = 92%) (Fig. 5C). At 20 |xM 
concentrations, the TREs significantly decrease, with V^^s hav- 
ing a TRE range of 65—95% and a TRE median dropping to 
84.5% (Wilcoxon matched-pairs signed rank test, two-tailed; P = 
0.0469) (Fig. 5C). However, for four V^^s (HVLP335, HVLP364, 
HVLP389, HVLP3103), TRE values still remains above 90%, 
and for three of the four, TREs do not change when V^^ concen- 
trations are increased from 4 |xM to 20 |xM. For comparison, 



the TREs of the eight non-aggregating V^^s at 4 |jlM and 20 jjlM 
concentrations are also included (Fig. 5C; Fig. S4). Two of 
the eight y^s (HVHP428 and HVHP413) showed low TREs of 
48% and 7% at 4 |xM, with the remaining 6 V^s (HVHP414, 
HVHP421, HVHP429, HVHM44, HVHP420, HVHP419) 
having TREs of 90-98% (median = 92%). As with V^s, the 
TREs of Vj^s at 20 |jlM concentrations decreased significantly 
(median = 78.5%, Wilcoxon matched-pairs signed rank test, 
two-tailed; P = 0.0078), with HVHP428 and HVHP413 V^^s 
failing to refold (TRE = 2% and 1%, respectively). Statistical 
analysis of TREs revealed that V^s were comparable to V^s in 
terms of reversibility of thermal unfolding [Mann-Whitney test, 
two-tailed; P= 0.8785 (4 jjlM), /'= 0.5054 (20 |jlM)]. Moreover, 
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Figure 3. Size-exclusion chromatography and SPR analyses of wild-type and mutant V^^s. (A) Superdex 
75™ size-exclusion chromatograms of V^^s with monomeric peaks marked with arrowheads. For 
HVLP364S, the dimeric V,^ peak situated to the left of the monomeric peak is visible. (B) SPR analysis of V,^ 
binding to a Ni^*-NTA sensorchip. A llama V„H iVlonomer (A4.2''') and a Dimer**' were used as controls. 
Measurements were taken at two dissociation phase windows (31 5-375 sand 780-840 s) and the mean 
values obtained from two independent trials performed in duplicates were recorded in the table. SPR 
experiments were performed with SEC-purified V^^s. 



the drops in TRE values as a function of increased concentra- 
tions of Vj^s and V^^s are significant and expected as previously 
reported." No significant correlation between TRE and was 
found. 

Protease resistance of human Vj^s 

In addition to thermostability, e.g., T^, protease resistance is 
also a measure of protein stability. To determine protease resis- 
tance, Vj^s were treated with major gastrointestinal (GI) pro- 
teases, trypsin, chymotrypsin, and pepsin, at various protease 
concentrations (Fig. 6; Table 1). As expected, a gradual decrease 
in protease resistance of V^^s was observed as a function of pro- 
tease concentration for all three proteases (Fig. 6A, B, and C). 
At the highest trypsin concentration (20 |JLg/mL), the protease 
resistance of V^^s was in the range of 0-75% with a median resis- 
tance of 16% (Fig. 6A). The V^^s, however, demonstrate higher 
resistance to chymotrypsin (Fig. 6B). For example, the Vj^s dem- 
onstrated a median resistance of 75% at a chymotrypsin concen- 
tration of 10 fJLg/mL compared with 33% for trypsin at the same 
concentration; at a higher chymotrypsin concentration of 20 |JLg/ 
mL, Vj^s had a protease resistance range and median of 34-100% 
and 67.5%, respectively. Vj^s showed the least resistance to pep- 
sin (Fig. 6C). At 1 jjLg/mL pepsin concentration, two of the V^^s 
(HVLP364 and HVLP389) were digested almost completely 
with 8% and 2% pepsin resistance, respectively, and at 10 jjtg/ 
mL pepsin concentration, the resistance of all Vj^s decreased to 
below 34% (median resistance = 1.5%) (Table 1). At 20 jJtg/ 



I (%) mL pepsin concentration, the pepsin 
resistance range and median were 
0-23% and 0%, respectively. 

We compared V^^s to the eight 
non-aggregating V^s in terms of 
resistance to the three proteases at 
10 |xg/mL protease concentration 
(Fig. 6D). We found that there was 
no significant difference between 
Vj^s and V^^s with respect to resis- 
tance to trypsin (Mann-Whitney 
test, two-tailed; P = 0.1605). V^^s 
were significantly more resistant 
to chymotrypsin than V^^s (Mann- 
Whitney test, two-tailed; P = 
0.0148), while the opposite was true 
with respect to resistance to pepsin 
(Mann-Whitney test, two-tailed; P = 
0.0011). Theoretical numbers of pro- 
tease cleavage sites were determined 
by a protease digestion prediction 
webware (http://web.expasy.org/ 
peptide_cutter) . It was found that 
Vj^s had significantly higher number 
of protease cleavage sites than V^^s 
for all three proteases [trypsin sites 
medians: 8.5 and 10.5 for V^^ and 
V^, respectively, (Mann-Whitney 
test, two-tailed; P = 0.0207); chymo- 
trypsin sites medians: 10 and 13.5 for 
respectively, (Mann-Whitney test, two-tailed; P = 
0.0070); pepsin sites medians: 30 and 39.5 for V^^ and V^^, respec- 
tively [Mann-Whitney test, two-tailed; P= 0.0002)]. 
Disulfide linkage engineering of human Vj^s 
To improve the stability of human V^^s, we created 8 Vj^ 
mutants (HVLP324S, HVLP325S, HVLP335S, HVLP342S, 
HVLP351S, HVLP364S, HVLP389S, and HVLP3103S) with 
a pair of Cys substitutions at amino acid positions 48 and 64 
(Fig. 2A). All were expressed well in E. coli, albeit with lower 
yields compared with wild-type V^^s, with expression yields 
ranging from 1.1 mg/L of bacterial culture in shaker flasks for 
HVLP324S to -11 mg/L for HVLP342S (Table 1). 

To determine if the engineered Cys pairs formed the desired 
disulfide linkages in the mutants, mass spectrometry (MS) was 
performed by analyzing tryptic digests of mutant V^^s. The 
MS analyses revealed that the disulfide linkage was formed as 
intended in all V^^ mutants (Fig. S5; Table 2). The disulfide- 
linked peptide ions appeared prominent in the survey of LC-MS 
chromatograms with tryptic peptides of the mutant V^^s. The 
expected disulfide-linked peptide sequences corresponding to 
each mutant Vj^s were confirmed by manual de novo sequencing. 
When there was only one disulfide linkage between two peptides, 
the exact disulfide linkage position was confirmed by an almost 
complete disulfide-linked y fragment ion series from one peptide 
with the other peptide attached as a modification via a disulfide 
bond that remains intact under collision-induced dissociation 
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Figure 4. Thermostability analysis of antibody domains. Thermal unfolding curves of (A) wild-type V^^s, (B) V„s, and (C) mutant V^^s. T^s were calculated 
and incorporated into Figure 4D and Table I.The non-aggregating V^s are described in ref. 48. (D) Graph comparing the T^s of wild-type V^^s to those 
of V^s and mutant V^^s. 



(CID),''^''''«« e.g., the disulfide linked peptides ATLSCR 
(PI) and GSGTLFTLTISSLEPEDSAVYFCQQR (P2) of 
HVLP325S (Table 2; MS^ data not shown). When there were 
three peptides linked by two disulfide bonds, the y fi-agment 
ion containing the linkage close to the N-terminal of two pep- 
tides was difficult to observe. Nevertheless, an almost com- 
plete y ion series of each peptide was observed. For example, a 
prominent ion at m/z 1042.66 (6+) from HVLP324S tryp- 
tic disulfide-linked peptide LLCFAASTLQSGVPSR (PI), 
FSCSGSGTDFTLTISNLQPEDFATYYCQQSYSTPR (P2) 
and VTITCR (P3) was observed from the survey of LC-MS 
chromatogram (Fig. S5B (top panel); Table 2). Informative y 
fragment ions were observed from P2 with P3 as a modification 
via a disulfide bond, and an almost complete y ion series of PI 
was observed as well (Fig. S5B, top panel). To further confirm 
the above disulfide bond formation in the mutant HVLP324S, 
the electron transfer dissociation (ETD)-MS~ spectrum of the 
peptide ion [M + 5FI]'* at m/z 1250.99 (5+) from the same disul- 
fide-linked peptide of HVLP324S was acquired (Fig. S5B, mid- 
dle panel). The most abundant charge-reduced ETD fragment 
ion [M + at m/z 1563.49 (4-h) was selected for CID to 

obtain the ETD-CID-MS^ spectrum of the m/z 1250.99 (5+) ion 
(Fig. S5B, bottom panel). The intact PI, P2, and P3 ions at m/z 
691.47 (1+), 1648.86 (1+), and 1956.85 (2+), respectively, were all 



observed at relatively high abundances upon dissociation of the 
disulfide linkages of the three linked peptides by ETD. Tryptic 
peptides linked by the engineered disulfide bonds were positively 
identified for all mutant Vj^s. These fragments are recorded in 
Table 2. 

Aggregation status of disulfide linkage-engineered human 

Next, we aimed to determine if the engineered disulfide 
linkage compromised the non-aggregation status of V^^s. To 
this end, we assessed the mutant V^^s by Superdex 75™ SEC. 
Except for HVLP364S, which seemingly formed dimeric aggre- 
gates at -17%, the remaining seven Vj^ mutants were monomeric 
(Fig. 3A), indicating that similar to wild-type V^^s, mutant V^^s 
with the extra disulfide linkage were aggregation-resistant. 
Moreover, the slight aggregation observed in HVLP325 disap- 
peared in the mutant version, HVLP325S. The M s of mutant 

. . . "Pf. 

Vj^s, calculated from their elution volumes, were similar to those 

of wild-type versions, ranging from 4.9 kDa to 23.5 kDa with 
a mean M ± SEM of 11.4 ± 2.3 kDa compared with 13.7 ± 

app ^ r 

2.2 kDa for the wild-type V^^s. The SEC results were further 
confirmed by the MALS data, which showed that the V^^s were 
indeed monomeric as their experimental M^^^^^s were very close 
to their theoretical M^^s (Table 1). Furthermore, the minor 
HVLP364S peak identified as corresponding to a V^^ dimer in 
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Figure 5. Thermal refolding efficiency determination of human V^^s and 
V^s by SPR. (A), (B) Representative SPR sensorgrams for the binding of 
native and heat-denatured/cooled (refolded) human V^s to immobilized 
protein L. Data are from TRE experiments performed at 20 nMV,^ concen- 
trations, concentrations used to construct each sensorgram set was 
12.5, 18.8, 25, 37.5, 50, 75, and 100 nIVl for HVLP325 and 12.5, 20, 25, 30, 
38, 50, and 75 nM for HVLP335. See Figure S3 and S4 for SPR data 
for all V^s and V^s. The K^ns and K^refs determined from "Native" and 
"Refolded" sensorgram pairs were used to determine TREs in (C). (C)TREs 
of wild-type V^^s and V^s obtained under 4 \iM and 20 |jM domain con- 
centration conditions in refolding experiments. Lines connect TREs for 
the same clones. 



SEC had indeed the Af^^^s ^ dimeric V^: 26.21 ± 0.47 kDa 
(vs. the expected M^^^ of 27.94 kDa). 

Probing conformational changes in disulfide linkage-engi- 
neered human Vj^s by protein L binding 

To probe any possible structural changes brought about by the 
non-canonical disulfide linkage in mutants, the binding of 
Vj^ mutants to protein L was quantified by SPR. We found that 
all mutants, with the exception of HVLP324S, HVLP342S, 
and HVLP351S, bound to protein L with almost the same K^s 
(and ^^^s and k^^^) as their wild-type counterparts (Fig. S2; 
Table 1), indicating that, for the majority of V^^s, there were no 
structural changes due to the engineered disulfide linkage, or if 
there were any, they were too subtle to be sensed by protein L. 
HVLP324S, HVLP342S and HVLP351S showed 2- to 3-fold 
affinity increase toward protein L low affinity binding sites 
compared with wild-type counterparts. In contrast, in all cases 
the stoichiometry of V^^: protein L binding remained unchanged 



between the wild-type and corresponding mutant V^^s. In the 
case of HVLP364S, the seemed to be in the low micromolar 
range, but a reliable K^, and consequently a reliable stoichiom- 
etry, could not be determined by SPR due to aggregate contami- 
nation in the V^^ sample. A homology structure of HVLP324S 
suggests that the non-canonical disulfide linkage and protein L 
binding site occupy distinct locations in mutant V^^s (Fig. 7). In 
contrast, for a with a similar disulfide linkage mutation, the 
non-canonical disulfide linkage is very intimate with and imbed- 
ded within the protein A binding site (Fig. 7). 

Thermal stability of disulfide linkage-engineered human 

To determine the effect of the non-canonical disulfide link- 
age on the thermal stability of V^^s, the 7^s of mutant V^^s were 
determined and compared with those for wild-type V^^s. The T^s 
were in the range of 63.8-82.5 °C (median = 72.9 °C) com- 
pared with 51.9-68.5 °C (median T = 61.9 "°C) for wild-type 
V^s (Fig. 4C and D; Table 1). The thermostability improve- 
ments were significant, reflecting a 7^ increase (A7^) range 
and median of 5.4-17.3 °C and 12.4 °C, respectively (Fig. 4D; 
Table 1) (Wilcoxon matched-pairs signed rank test, two-tailed; 
P = 0.0078). Although the result indicated that the engineered 
disulfide linkage stabilized the V^^s regardless of their germline 
subtypes (k or X.), the thermostability improvements were more 
pronounced for the k3 and X.1 subgroup members compared 
with the k1 subgroup members. 

GI protease resistance of disulfide linkage-engineered 
human Vj^s 

Previously, Vj^Hs were shown to have acquired protease resis- 
tance with the addition of a similar non-canonical disulfide link- 
age.'" We therefore investigated the effect of the non-canonical 
disulfide linkage on the resistance of V^s to trypsin, chymotryp- 
sin, and pepsin. Mutant Vj^s were digested with varying concen- 
trations of proteases under the same digestion conditions as for 
wild-type V^^s (Fig. 6). We observed that there was no signifi- 
cant overall difference between wild-type and mutant V^^s with 
respect to resistance to trypsin or chymotrypsin (Fig. 6A, 6B, 
and 6E; Wilcoxon matched-pairs signed rank test, two-tailed; P 
= 0.2969, 0.8438, 0.8438, and 1.0625 for trypsin at I |JLg/mL, 
5 |JLg/mL, 10 |xg/mL, and 20 |JLg/mL, respectively; P = 0.0625, 
0.1563, 0.1484, and 0.0781 for chymotrypsin at 1 |JLg/mL, 
5 jJLg/mL, 10 |JLg/mL, and 20 jJLg/mL, respectively). 

Mutant Vj^s, however, showed improved resistance to pep- 
sin. The pepsin resistance medians of wild-type Vj^s were 73% 
and 1.5% at enzyme concentrations of 1 |JLg/mL and 10 |JLg/mL, 
respectively, and decreased to 0% at the concentration of 20 jJLg/ 
mL (Fig. 6C) . In contrast, mutant V^^s had pepsin resistance medi- 
ans of 100%, 54%, 39.5%, and 25.9% at enzyme concentrations 
of 1, 10, 20, and 50 |xg/mL, respectively. The pepsin resistance 
improvements were significant at 1 and 10 |xg/mL enzyme con- 
centrations (Fig. 6E; Wilcoxon matched-pairs signed rank test, 
two-tailed; P = 0.0078 and 0.0156, respectively), but not so at 20 
and 50 |JLg/mL enzyme concentrations (p = 0.0625 for both con- 
centration conditions) . Moreover, the correlation between prote- 
ase resistance and 7^ was explored (Fig. 6F). It was found that 
in general, Vj^s with higher 7^s had higher resistance to pepsin 
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(Pearson's correlation, 
0.6077; P = 0.0125, 
r' = 0.3693, at 1 |xg/mL 
enzyme concentration; 
Pearson's correlation, 
0.7807; P = 0.0004, 
r' = 0.6095, at 10 fig/ 
mL enzyme concentra- 
tion; Pearson's correla- 
tion, 0.7438; P= 0.0010, 
r' = 0.5532, at 20 |xg/mL 
enzyme concentration; 
Pearson's correlation, 
0.6837; P = 0.0035, 
= 0.4674, at 50 |JLg/ 
mL enzyme concentra- 
tion) . No significant 
correlation was observed 
in the case of trypsin or 
chymotrypsin (data not 
shown) . 

Discussion 

Numerous publica- 
tions in the past two 
decades have firmly 
established the suitabil- 
ity of sdAbs as thera- 
peutic and diagnostic 
agents. Human Y ^ and 
Vj^ sdAbs, in particu- 
lar, have been pursued 
as therapeutics due to 
their expected lower 
immunogenicity in 
patients compared with 
other classes of sdAbs 
such as camelid V^jHs 
and shark V^^^s."'^'^^ A 
number of studies have 
highlighted V^^s as bona 
fide affinity reagents, 
and a few, in particu- 
lar, have pointed to an 
inherent property of V^^s 
as being more aggre- 
gation resistant than 
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Figure 6. Gl protease resistance profiles for human V^^s and V^^s. (A), (B), (C) Trypsin, chymotrypsin, and pepsin resistance 
(res.) of human V|^s (open circles) and their corresponding Cys mutants (closed circles). (D) Graph comparing V^^s to V„s in 
terms of resistance to trypsin, chymotrypsin, and pepsin at 10 pg/mL protease concentrations. Horizontal lines in graphs 
(A)-(D) represent medians. (E) Graph showing trypsin, chymotrypsin, and pepsin resistance of human V^^s (open circles) 
and their corresponding Cys mutants (closed circles) in pair-wise manner at 10 pg/mL protease concentrations (see also 
Table 1). Lines connect protease resistances value for each to that for its corresponding mutant version. The p values 
in graphs (D) and (E) were obtained by the iVlann-Whitney test (two-tailed) and Wilcoxon matched-pairs signed rank test 
(two-tailed), respectively, using GraphPad Prism (GraphPad Software). (F) A correlation graph of pepsin resistance vs. 

(Pearson's correlation, 0.7807; p = 0.0004, r^ = 0.6095). Data are from digestion experiments performed at 10 ng/mL 
pepsin concentrations. Open circles, wild-type V^^s; closed circles, mutant V^^s. 



ThuS; 

from the aggregation 

point of view, human V^^s may be preferable over human V^jS as 
immunotherapeutics. In this study, we set out to obtain a deeper 
understanding of Vj^s with respect to a number of biophysical 
properties, including their aggregation tendencies. 

Human domains are known for their general tendency 
to aggregate. Previously, a phage selection method was used to 



obtain exclusively non-aggregating human V^^ domains from a 
naive human Vj^ phage display library that was propagated as 
plaques.**^ The library, with a size of 6 x 10** transformants, was 
panned against the B cell super-antigen protein A, and sequenc- 
ing of more than 110 clones with complete V^^ open reading 
frames yielded a total of 15 non-aggregating V^^s. By applying 
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Figure 7. Homology structures of HVLP324S and huVHAnn302S V^, 
comparing the positioning of protein L binding site to protein A bind- 
ing site relative to the engineered non-canonical disulfide linkages (blue 
spheres) for the V,^ and V^, respectively. huVHAm302S is a previously 
described human that lost its protein A binding activity by 3.5-fold 
upon the introduction of a non-canonical disulfide linkage at positions 
49 and 69.''^ Red spheres represent the native, canonical disulfide link- 
age, pink- and green-colored regions CDRs and FRs, respectively. Amino 
acid residues forming protein L binding sites 1 and 2 (top, left panel) 
and protein A binding site (top, right panel) are shown. The binding site 
residues were identified based on the published crystal structures of the 



essentially the same selection approach, we obtained a total of 
32 unique sequences out of 34 screened clones from a 200-fold 
smaller-sized V^^ library. Given that the selection method has 



been shown to select only for non-aggregating domains, this high 
yield isolation of Vj^s implies that human Vj^s are more frequently 
aggregation resistant than human V^^s, which is consistent with 
previous findings. This is further supported by the fact that 
our subset of randomly selected protein L binding V^^s represent- 
ing different germline origins were indeed non-aggregating as 
shown by size-exclusion chromatography, hexa-histidine capture 
SPR and multiangle light scattering experiments. This also con- 
firms the power of the aforementioned selection approach for the 
isolation of non-aggregating proteins. However, an intrinsic abil- 
ity of protein L to screen out structurally compromised, aggregat- 
ing domains in favor of non-aggregating protein L binders during 
panning experiments is a possibility that may have contributed to 
the strong selective power of the approach. 

It is not surprising that from a library consisting mostly of V,^ 
class Vj^ domains, the vast majority of the protein L binding V^^s 
isolated (31 out of 32 unique sequences) were of type (V^3 and 
V^l subgroups) with only one being of a V^^ type (HVLP389). 
Previous studies have shown that protein L predominantly 
binds to the type domains, specifically to those in the sub- 
groups V^l, V^3, and V^4, with a paucity of binding to X. class 
y^j_4o,4i,s2-66 Qiven the selective nature of our approach for stable 
(non-aggregating) domains, the predominance of V^3 subgroup 
type VjS (24/31) followed next by V^l subgroup type V^^s (7/31), 
and the absence of any V^4 subgroup type V^^s, in the pool of 
selected binders may be a reflection of the relative stability of Vj^s. 
Previously, it was shown that of the four Vj^ domains representing 
the consensus sequences of human subgroups V^l-4, the V^3 V^^ 
was the most thermodynamically stable followed by the V^l V^^, 
with the V^4 appearing to be the least stable of the three.'' 
Also, unlike the first two V, s, which were monomeric, the V 4 

' L ' ' K 

formed dimers, an indication of its aggregation tendency.'' 
In fact, a bleak selection for the V,^ class V^s in this study may 
not have been just the result of their general lack of binding to 
protein L, but also the result of their lower stability compared 
with V class V, s." Consistent with this is the fact that our lone 

K L 

X. class binder belongs to the V^^l subgroup, a subgroup whose one 
representative was shown to be more thermodynamically stable 
than two other V^^s representing subgroups V^l and V^3. It is 
also possible, however, that the relative proportion of V^3 and 
V 1 binders may simply reflect their relative proportions in the 
original, unselected Vj^ library. 

Mutations with respect to V^^ germline sequences were 
observed for the pool of selected V^s, but, in the absence of 
mutational studies, it is very difficult to assign solubility roles 
to mutation positions. Significant mutations at position 96, e.g., 
mutations from a hydrophobic germline amino acid to a posi- 
tively-charged amino acid in the vast majority of Vj^s, including 6 
of the 8 non- aggregating, representative V^^s, suggest a solubility 
role for position 96. Consistent with this, previous studies with 
immunoglobulin k1 light chains have suggested that while an 
aromatic or hydrophobic residue at position 96 enhances dimer- 
ization, a charged amino acid (Arg) at the same position results 
in stable light chain monomers. ""^ It was explained that Arg-Arg 
charge repulsion at positions 96 of monomers would interfere 
with dimer formation.''^ Further mutational studies are requireto 



complex between a human antibody Fab fragment (2A2, k1 subtype, 
family) and a single Peptostreptococcus magnus protein L domain^'^ (V ) 
and between Fab 2A2 and domain D of Staphylococcus aureus protein A 
(VJ. iVliddle and bottom panels show side and top views of the protein 
L and protein A binding sites in dots and sticks presentations. Protein 
homology structures were obtained as described in the Figure 2 leg- 
end. The figures were drawn with PylVlOL (http://www.pymol.org) and 
manipulated using Adobe Photoshop CS2 software. 
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Table 2. Disulfide linkage determination of V^^s by iVlS analyses 





Tryptic peptides'' 


(Da) 


M 

exp 

(Da) 


AW 
(Da) 


HVLP324S 


\/TIT/"D 1 1 fQI\ ACTI f^C/~\/DCD 
V 1 1 1 Ln...LLtrAAj 1 LtJ JO VroK... 

FSCSGSGTDFTLTISNLQPEDFATYYCQQSYSTPR 


6249.91'= 


6249.96" 


-0.05" 


HVLP325S 


ATLSCR...GSGTLFTLTISSLEPEDSAVYFCQQR 


3495.66 


3495.56 


0.10 


LLCFDTSNR...FSCR 


1576.71" 


1576.68" 


0.03" 


HVLP335S 


LLCYGTSNR...FSCSGSGTHFTLTI N R 


2750.29'' 


2750.32" 


-0.03" 


ATLSCR...LEPGDFAVYYCQQYGSSPR 


2826.27 


2826.20 


0.07 


HVLP342S 


VTITCR...LCYGASSLQGGVPSR...FSCSGSGTEFTLJI 
■iGl OPFDFATYYCi OHHTYPR 


61 36.85" 


61 36.80" 


0.05" 


HVLP351S 


ATLSCR...LLCYDASNR... 
rj^juju 1 Ur 1 LI IbjLtrt UrAVYYLtJvJn 


5049.26" 


5049.00" 


0.26" 


HVLP364S 


LLCYGASSR...FSCSGSGTDFTLTISR 


2644.23" 


2644.32" 


-0.09" 


Al rb5.K...LbrLLJrAVY Yl-UUYLJ 1 brK 


iUU4.zy 


3004.35 


-0.06 


HVLP389S 


LLCYGNDK...FSCSK 


1492.65" 


1492.59" 


0.06" 


VTISCSGSSYNIGENSVSWYQQLPGTAPK... 
SGTSAT LGITGLQTGDEADYYCGTWDSNLR 


6221.84 


6221.97 


-0.13 


HVLP3103S 


ATLSCR. . . LLCYGASTR. . . FSCSGSGTDFTLTISSLQV 
EDVAVYYCQQYYTTPK 


5528.56" 


5528.52" 


0.04" 



'Major tryptic peptides containing disulfide linkages are shown, with connecting cysteine residues single or double underlined (native or engineered Cys 
(C), respectively) and boldfaced (see Fig. S5 for experimental details). The triple dots between peptides denote sequence discontinuity, which was caused 
by the loss of V,^ sequences after trypsin digestion; the discontinuing peptides, however, are held together by disulfide linkage(s); "The very close match 
between M^^^ (formula molecular mass) and M^^^ (experimental molecular mass) indicates the presence of the Cys48-Cys64 disulfide linkage. 'A/M = M^^^ - 
M^^p. In addition, the disulfide linkages were confirmed by de novo sequencing the CID or ETD spectra of the disulfide linked peptides (e.g., see Fig. S5B). 



determine if the presence of a positively-charged amino acid at 
position 96 leads to V^^s that are more aggregation resistant, and 
if so, whether a negatively-charged amino acid would have the 
same effect. Other studies with immunoglobulin domains 
showed that substitution with negatively-charged amino acids, 
Asp substitution in particular, were more effective than posi- 
tively-charged substitutions in increasing the aggregation resis- 
tance of Vj^s.'"'* 

We also observed that at FR4 positions 105, 106, and 107, 
instead of the typical Asp/Glu, He, and Lys, the majority of the k 
class Vj^s had Thr, Val, and Leu, amino acids, respectively, which 
are characteristic of X. class Vj^s. Whether the substitutions have 
a role in improving the biophysical properties of the k class Vj^s 
remains to be seen. The importance of FR4 residues in improving 
the aggregation resistance of immunoglobulin variable domains 
(Vj^Hs and V^s) has been suggested.''''^" For example, V^^Hs, 
which are known for their high aggregation resistance, have a 
highly conserved 105Q mutation compared with the aggregation 
prone V^jS. Similarly, V^^s with the 105Q mutation have been 
shown to have improved aggregation resistance compared with a 
corresponding wild-type V^^. Also, the role of J segment, which 
codes for the FR4 amino acids, in increasing the thermal stability 
and non-aggregation of V^^Hs has been shown by others." 

SPR binding experiments on the eight representative non- 
aggregating Vj^s confirmed their protein L binding activity. Of 
interest were the two V^l type binders, which unlike the V^S 
and Vj^ type V^^s that bound to protein L with similar micromo- 
lar affinities (1-3 |JlM), bound to protein L with low (0.2 jjiM 



and 0.6 \sM) and high (0.04 jjlM and 0.07 |xM) affinities. High 
and low affinity binding of human Fab and light chain of V^l- 
subgroup type to 2 distinct sites on single Ig binding domains of 
protein L have been reported previously.''^''''' Here a differential 
Vj^:protein L stoichiometry was also observed, with the X type 
having a 1:1 binding stoichiometry, the V^3 Vj^s having a 
3:1 binding stoichiometry in the majority of cases, and V^l hav- 
ing a 7:1 binding stoichiometry. Previous SPR binding studies 
with 5 Ig-binding domains of a protein L showed that while all 
5 domains bound to a human light chain of V^l subgroup, 
only 3 bound to a human light chain of V^3 subgroup, sup- 
porting the higher observed stoichiometry for V 1 binders com- 
pared with V^3 binders and the 3:1 stoichiometry found for the 
V^3 binders. A binding stoichiometry of 7:1 in the case V^l sub- 
group binders is plausible, given that our protein L consisted of 
4 Ig-binding domains with each Ig-binding domain having up 
to 2 binding sites that can simultaneously engage with 2 sites 
on V^l subgroup binders. ''^•''''^' A higher affinity and stoichiom- 
etry (higher avidity) in the case of V^l type antibodies should 
translate to their more sensitive detection by protein L. The low 
affinity and lack of avidity as a result of a 1:1 stoichiometry, as 
shown here for the Vj^ type HVLP389, may be the reason for the 
reported weak interactions between human Ig X.-light chains and 
protein L.'"'^^'^^ Thus, failing to detect a Vj^-protein L interaction 
should not be interpreted as the lack of protein L binding activity 
on the part of a type antibody. It should, however, be men- 
tioned that the stoichiometry values are estimates. If the surface 
is not fully active the theoretical R_^^^ for 1:1 binding cannot be 
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attained, resulting in underestimation of tire binding stoichiome- 
try. Isothermal titration calorimetry may provide a more accurate 
means of determining the binding stoichiometrics. 

Consistent with being highly non-aggregating at concentra- 
tions as high as ==450 |xM, the V^^s showed high reversibility of 
thermal unfolding at relatively high V^^ concentrations. In this 
respect, and with respect to 7^^, the V^^s performed as well as 
our set of non-aggregating Vj^s. The observed lack of correla- 
tion between T'^ and aggregation resistance, expressed in terms 
of TREs, is consistent with previous findings that showed that 
aggregation resistant V^jS may not necessarily have high 7^ or 
thermodynamic stability.^^ Consistent with this finding is the 
fact that the V^^ with the highest T'^ from among the eight V^^s in 
this study (HVLP325) was also the only one that showed some 
degree of aggregation. 

The V^s were different from V^^s with respect to GI prote- 
ase resistance patterns. That is, while Vj^s were more resistant to 
chymotrypsin, the V^jS were more so to pepsin, with both V^^s 
and Vj^s being comparable in terms of resistance to trypsin. The 
observed protease resistance data for only chymotrypsin could be 
explained in terms of the number of potential cleavage sites, i.e., 
V^s had a significantly lower number of potential chymotrypsin 
cleavage sites than V^jS. Even when only the more protease acces- 
sible CDR sequences were considered in the calculation of theo- 
retical number of cleavage sites, the obtained numbers did not 
fully explain the observed differences between protease resistance 
profiles of V^^s and V^^s (data not shown). Previously for V^Hs, 
increases in pepsin resistance were correlated with increases in 
7^s, but this correlation cannot explain the better pepsin resis- 
tance of Vj^s here because both V^s and V^^s had very similar 
T^s.*^ As discussed previously, protease sensitivity is a function of 
a number of variables including the theoretical number of proteo- 
lytic sites, the location of proteolytic sites, and protein compact- 
ness and thermodynamic stability.^''^''^^ 

In an effort to further improve the biophysical properties 
of Vj^s, we introduced a pair of cysteine residues at amino acid 
positions 48 and 64, hypothesizing that this would lead to the 
formation of a disulfide linkage in the sdAb core. Previously, 
it was shown that the substitution for a pair of cysteine at spa- 
tially equivalent positions in V^^Hs and V^^s led to the forma- 
tion of disulfide linkages in all domains tested, with subsequent 
improvements in thermostability and protease resistance.'"'''"''" 
We too find here that all the Vj^s with the added Cys pair have 
the intended disulfide linkage, as well as improved thermostabil- 
ity (7^) and protease resistance. We find that the increases in T^^s 
(Arfare relatively high (5.4-17.3 °C, median Ar : 12.4 °C) 
compared with those obtained for a previously reported set of 
VpjHs with similar engineered disulfide linkages (A7^^: 4-12 °C, 
median AT^: 7 "C).*" This may, at least partly, be due to the 
fact that T^^s were obtained under different assay conditions and 
instrument settings. It cannot be excluded, however, that a non- 
canonical disulfide linkage may have been a better fit to the over- 
all fold of Vj^s, leading to their overall higher AZ'^ gains.'" This 
may also explain the differential T^^ gains observed among the 
mutant Vj^s that were characterized under identical conditions. 
We also find that the thermostability gain due to the engineered 



disulfide linkage is more pronounced for the k3 and \1 mem- 
bers, compared with the k1 members; however, general conclu- 
sions should await further experiments involving a statistically 
appropriate sample size. 

In terms of protease resistance, the non-canonical disulfide 
linkage led to increases in pepsin resistance of V^^s, without 
compromising their trypsin or chymotrypsin resistance. This is 
similar to the results obtained with a set of V^^Hs with similar 
non-canonical disulfide linkages.'" As with the V^jHs, positive 
correlations between pepsin resistance and were also observed. 
The higher pepsin resistance of the mutant Vj^s may be due to the 
fact that they may have a more compact and thermodynamically 
stable structure — equating to higher resistance to acid-induced 
unfolding under pepsin digestion conditions (pH = 2; pepsin 
is more effective on denatured proteins) — compared with the 
wild-type V^^s without the non-canonical disulfide linkage.^' 
Higher 7^s of mutants, suggesting their higher thermodynamic 
stabilities, and the fact that V^Hs with similar engineered disul- 
fide linkage became resistant to unfolding at pH 2 and 37 °C 
support this speculation.'" Importantly, the introduction of the 
non-canonical disulfide linkage into V^^s does not appear to 
compromise their aggregation resistance. This was shown to be 
the case for V^Hs and V^^s with similar non-canonical disulfide 
linkages as well.*''''*'-' In the case of V^^s, this led to improvements 
in the aggregation resistance of mutants compared with the wild- 
type counterparts without the non-canonical disulfide linkage. 

The biophysical improvements gained through the intro- 
duction of the non-canonical disulfide linkage do come at the 
expense of expression yield as mutants demonstrated significantly 
lower expression yields than wild-type V^^s in E. colt. This was 
also reported in the case of V^^Hs, which like the Vj^s here, were 
expressed in E. colt.'''' Thus, the relatively lower expression yields 
of Cys mutant domains compared with wild-type ones may have 
to do with the limited capacity of E. coli in folding proteins with 
higher disulfide linkages such as the Cys mutant V^^s in this 
study. This should be resolved by expressing the mutant Vj^s in 
eukaryotic microorganisms, e.g., yeast or mammalian cells, with 
the capacity to fold complex proteins such as those with multiple 
disulfide linkages. 

The biophysical improvements also come at the expense of 
undesirable conformational changes for mutants, which were also 
reported in the case of V^^Hs and V^^s.'"^''" The observed differ- 
ential protease resistance profiles between wild-type and corre- 
sponding mutant V^^s, as well as changes to protein L binding for 
some of the mutants, support this conclusion. However, for the 
majority of V^^s, the conformational changes, as determined by 
binding measurements of wild-type and Cys mutant Vj^s against 
protein L, are too subtle to be sensed by protein L, which binds 
to VjS in a conformation-dependent manner.'" This is in sharp 
contrast to the results obtained with our Cys mutant V^^s,'*'' where 
structural changes as a result of the introduction of non-canoni- 
cal disulfide linkages were more easily probed with protein A and 
led to up to 10-fold reductions in protein A binding of mutant 
Vj^s. Such discrepancy could be due to the fact that, while for V^^s 
the protein L binding site is too far from the engineered disulfide 
linkage to be affected by it, for V^s the protein A binding site 
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is in the influencing range of the non-canonical disulfide link- 
age. This is clearly supported by our homology structure data 
of a and a with similar non-canonical disulfide linkages 
{see Figure 7). The conformational changes in V^^ mutants were 
not reflected in SPR stoichiometry data either, as all wild-type/ 
mutant pairs had the same stoichiometry of binding to protein L. 

In conclusion, we demonstrated the suitability of Vj^ sdAbs 
as affinity reagents, in particular, as immunotherapeutics, and 
provided insights into the biophysical characteristics of Vj^s. We 
identified a diversity of non-aggregating V^^ domains that could 
form the basis of therapeutics when incorporated as library scaf- 
folds (e.g., into sdAb phage display libraries) as has already been 
demonstrated.'^ As, irrespective of the degree of the stability of 
the original library scaffold, loop randomization always leads 
to a proportion of the library consisting of unstable domains, 
a coupling of affinity selection to stability selection during the 
panning experiments to filter out unstable binders from the pool 
of binders is advisable. ^''•'^'^^'^^ Moreover, we presented a general 
strategy based on disulfide linkage engineering for stabilizing Vj^s 
in terms of thermostability and pepsin resistance. The disulfide 
linkage engineered V^^s would be the preferred scaffolds for con- 
structing phage display libraries over versions without. Such 
libraries, especially if affinity selected under high temperature 
conditions, should yield non-aggregating immunotherapeutics 
that are also thermodynamically stable, like those domains 
that were selected by panning a phage display library under 
the acidic conditions.^^ Due to the positive correlation between 
pepsin resistance and T^, affinity selection under heat should 
yield binders which are also pepsin resistant, hence suitable as 
oral therapeutics for GI tract applications. The disulfide link- 
age engineering approach should thus be viewed as an efficacy 
engineering one for therapeutic V^^s. The fact that the affinity 
reagents obtained from the aforementioned Vj^ libraries would 
have protein L binding properties offers biotechnological advan- 
tages similar to those offered by V^^s that bind to B cell super- 
antigen protein A.*"* 

Beyond single domains, the disulfide linkage engineering 
approach can be applied to domains in the context of scFvs, Fabs, 
and IgGs. Previously, when a similar disulfide linkage engi- 
neering approach was applied to a in the context of a Fab, 
it resulted in a significant increase in the thermostability of the 
Fab.^'' However, the introduction of the disulfide linkage results 
in conformational changes that may further lead to undesirable 
affinity and specificity compromises as has been shown in the 
case of Vp^Hs with similar disulfide linkage engineering.^''" This 
should not be a concern with domains that are selected from 
libraries already containing the non-canonical disulfide linkage. 

Materials and Methods 

Human Vj^ library construction and panning 

A phage display library in a multivalent display format and 
with plaque formation as the selectable marker was constructed 
in a similar fashion as a previously described V^^ phage display 
library.''* It was anticipated that, as in the case of the phage 
display library, phages displaying non-aggregating domains 



(Vj^s in the present study) would form larger plaques on bacte- 
rial lawns than phages displaying aggregation prone domains, 
and, as a result, panning the library against super-antigen pro- 
tein L for several cycles should eventually enrich the library for 
phage-displayed, aggregation-resistant V^^s that bind protein L. 
Such Vj^s can subsequently be retrieved and expressed as solu- 
ble, autonomous domains. Briefly, cDNA was synthesized from 
human spleen mRNA as previously described.*** The cDNA was 
used as a template for PGR to amplify V^^ genes in 50 ^jlL reac- 
tion volumes using six and 1 1 V^^ back primers^^ and four 
and two forward primers.^* The back and forward primers 
were modified to have flanking ApaLI and NotI restriction sites, 
respectively, for subsequent cloning purposes. Forward prim- 
ers were pooled together in ratios that reflected their degree of 
degeneracy. V^^ genes were subjected to PGR in 11 separate reac- 
tions using the pooled V^^ forward primers with 1 1 individual V,^ 
back primers. Similarly, genes were amplified in six separate 
reactions using the pooled forward primers with six individ- 
ual back primers. The PGR products were gel-purified and 
digested with ApaLI and NotI restriction endonucleases, and the 
library was constructed as previously described.'"* PGR was per- 
formed on individual library colonies, and the amplified Vj^ genes 
were sequenced as described before.^' Panning (against protein L; 
Pierce) and the germline assignment of Vj^s were performed as 
previously described.''* 

Cloning, expression, purification, SEC, and MALS analysis 
Introduction of cysteine residues at Kabat positions 48 and 
64'''' of VjS was performed using a splice overlap extension 
method.'"' V^^ expression, purification, concentration determi- 
nation, and SEG were performed as described for V^s.''^'" Size- 
exclusion chromatograms were normalized as described.'"^"''^ The 
elution volume values obtained from chromatograms were used 
to calculate M s of V, s usine the Loe M vs. V standard curve 

app Loo e 

described in Figure SI. UPLG-SEG-MALS was performed with 
a Waters BioAcquity system equipped with a Waters PDA detec- 
tor (Waters). The column temperature was maintained at 30 °G 
with a GM-A column compartment. The samples (10-20 |xg) 
were injected onto a Waters BEH125 SEG column (4.6 mm x 
150 mm, 1.7 |xm particles) at 0.4 mL/min with PBS (-Ga -Mg, 
Hyclone Thermo Scientific) as the solvent. MALS data was 
measured on a Wyatt HELEOS 8 + detector (Wyatt Technology 
Corporation) and weight average molecular masses (■''^^^^ls^) 
were calculated using a protein concentration determined using 
the A^gjj from the PDA detector with extinction coefficients cal- 
culated from the amino acid sequences. Experiments were per- 
formed in duplicates. MALS data processing was performed with 
ASTRA 6.1 software (Wyatt Technology Corporation). 
Disulfide linkage determination 

Disulfide linkage determination for Cys mutant V^^s was per- 
formed as described elsewhere.'"'*' Briefly, tryptic fragments for 
subsequent MS analysis were prepared as described.''^ Purified 
V^s at -15 pmol/|JLL in 50% (v/v) acetonitrile + 0.1% (v/v) for- 
mic acid were infused at 1 |jLL/min for electrospray ionization 
(ESI) mass spectrometric mass measurements of the V^^s using 
a Q-TOF 2™ mass spectrometer (Waters). The mass spectra 
of the Vj^s were de-convoluted using the MaxEnt 1 program in 
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the MassLynx software package (Waters). Aliquots of Vj^ pro- 
teolytic digests were re-suspended in 0.1% (v/v) formic acid (aq) 
and subsequently analyzed by nano-flow reversed-phase HPLC 
MS (nanoRPLC-ESI-MS) with data-dependent analysis (DDA) 
using CID on a nanoAcquity UPLC system coupled to a Q-TOF 
Ultima™ hybrid quadrupole/TOF mass spectrometer (Waters). 
The peptides were first loaded onto a 180 |JLm I.D. x 20 mm 5 
|jLm symmetry®C18 trap (Waters) and then eluted into a 100 |JLm 
I.D. X 10 cm 1.7 |JLm BEH130C18 column (Waters) using a lin- 
ear gradient from 0% to 36% solvent B (acetonitrile + 0.1% for- 
mic acid) over 36 min followed by 36-90% solvent B for 2 min. 
Solvent A was 0.1% formic acid in water. The peptide MS^ spec- 
tra were compared with mutant V^^ protein sequences using the 
Mascot™ database searching algorithm (Matrix Science). The 
MS^ spectra of the disulfide-linked peptides were de-convoluted 
using the MaxEnt 3 program (Waters) for de novo sequencing 
to confirm and determine the exact disulfide linkage positions. 
NanoRPLC-ESI-MS analyses with DDA using a combination of 
ETD and CID were performed on an LTQXL mass spectrom- 
eter fitted with an ETD source (Thermo Fisher Scientific) and 
coupled to a nanoAcquity UPLC system (Waters) using the LC 
conditions described above. Briefly, for the DDA experiments, 
the most abundant peptide ion from the survey scan was used 
for the automatic acquisition of ETD-MS~ spectra using 30% 
normalized collision energy followed by the acquisition of ETD- 
CID-MS' spectra of the most abundant charge-reduced ions pro- 
duced by ETD. 

SPR binding studies 

Vj^s were subjected to Superdex 75™ (GE Healthcare) SEC 
in HBS-EP buffer (10 mM HEPES, pH 7.4, 150 mM NaCl, 3 
mM EDTA and 0.005% P20 surfactant) at 0.5 mL/min prior 
to BIACORE analysis and purified monomer peaks were col- 
lected even in the absence of any evidence of aggregated mate- 
rial. Binding kinetics for the interaction of V^^s to protein L were 
determined by SPR using a BIACORE 3000 biosensor system 
(GE Healthcare). For wild-type V^^s, -600 resonance units (RUs) 
of protein L or 800 RUs of a Fab reference were immobilized onto 
research grade CM5 sensorchips (GE Healthcare). For mutant 
V^s, -650 RUs of protein L (for HVLP324S, HVLP325S, 
and HVLP342S) or 400 RUs of protein L (for HVLP335S, 
HVLP351S, HVLP389S, and HVLP3103S) or 400 RUs of 
ovalbumin (Sigma-AIdrich) as a reference protein were immo- 
bilized. For HVLP324S and HVLP342S, -1,700 RUs of protein 
L or 2,600 RUs of a ovalbumin reference were immobilized onto 
research grade CM5 sensorchips to determine binding data for 
the low affinity binding sites on protein L. Immobilization was 
performed at protein concentrations of 20 or 50 |JLg/mL in 10 
mM acetate buffer, pH 4.5, using an amine coupling kit sup- 
plied by the manufacturer (GE Healthcare). All measurements 
were performed at 25 °C in HBS-EP buffer at 40 or 50 fjiL/min. 
The surfaces were regenerated through washing with the running 
buffer. Data were evaluated using BIAevaluation 4.1 software 
(GE Healthcare). The stoichiometry of binding (Vj^:protein L) 
was estimated by comparing the observed R^^^s and theoretical 
R s for 1 : 1 binding. 

max o 



SPR analysis for the binding of V^^s, a control llama V^jH 
monomer (A4.2), and a control homodimer to a Ni^*-NTA 
sensorchip was performed as described previously.^'' All J 
Vj^H domains had His^, tags at their C-termini and the protein 
preparations lacked contaminating species without the HiSj, tag 
as determined by Western blot and SDS-PAGE analyses. The 
homodimer is formed by the non-covalent association of two 
monomer units, and, as a result, has two C-terminal HiSj, tags. 
SPR experiments were performed with protein fractions corre- 
sponding to the dimeric peak in the case of the V^^ dimer con- 
trol and monomeric peaks for the remaining V|^/Vj^H domains. 
RUs from duplicate data sets were averaged and then normal- 
ized to obtain %RU. The SPR binding of antibody domains to 
an activated NTA sensorchip was determined at 25 °C using a 
BIACORE 3000. In each cycle, NiCl, was injected, followed by 
an injection of V^^/V^/Vj^H. Sensorgrams were run in duplicates. 
The NTA chip was regenerated with EDTA before the next cycle. 
Dissociation rate constants {k^^^) were obtained over 60 s periods 
between 315-375 s and 780-840 s and data were analyzed with 
BIAevaluation 4.1 software. 

Protease digestion experiments 

Digestion experiments were performed in a total volume of 
30 |xL with 6 |xg of Vj^ in phosphate-buffered saline (137 mM 
NaCl, 2.7 mM KCl, 4.3 mM Na^HPO,, 1.47 mM KH^PO,, pH 
7.4) at 37 °C for 1 h with an enzyme to V^^ ratio of 1:200, 1:40, 
1:20, and 1:10 (trypsin/chymotrypsin), corresponding to 1 |JLg/ 
mL, 5 |xg/mL, 10 |xg/mL, and 20 jjtg/mL protease concentra- 
tions, respectively, or 1:200, 1:20, 1:10, and 1:4 (pepsin), which 
corresponds to 1 |JLg/mL, 10 |JLg/mL, 20 |xg/mL, and 50 |jLg/mL 
protease concentrations, respectively. Tryptic and chymotryptic 
digestion experiments were performed using sequencing grade 
enzymes (Roche Diagnostics) according to the manufacturer's 
instructions. Peptic digestion experiments were performed at pH 
2.0 using pepsin from Sigma-AIdrich. In control experiments, 
enzymes were replaced with equal volumes of reaction buffers. 
Reactions were stopped by adding an equal volume of SDS- 
PAGE sample buffer containing 0.2 M dithiothreitol and boiling 
mixtures at 95 °C for 5 min. The samples were then subjected 
to SDS-PAGE analysis and the percentages of intact sdAbs after 
protease digestions were determined by spot density analysis as 
described.'" For V^jS, digestion experiments were performed as 
described for Vj^s with an enzyme to ratio of 1 :20. 

TRE experiments 

TREs were determined by SPR as previously described.''* 
Briefly, Vj^s or V^jS were incubated at 85 °C for 20 min at concen- 
trations of 4 or 20 |xM, slowly cooled to room temperature for 
30 min, centrifuged at 16,000 X ^ for 5 min at 22 °C, and the 
supernatants, termed "refolded," were kept for subsequent SPR 
analysis. Following this, binding analysis against protein L and 
protein A for Y^s and V^jS, respectively, was performed. Native 
and refolded domains were analyzed under identical conditions. 
For V^s at 4 |jlM, 600 RUs of protein L and 700 RUs of a Fab 
reference were immobilized at 25 or 50 |xg/mL in 10 mM acetate 
buffer, pH 4.5. For V^^s at 4 |xM, 600 RUs of protein A (Sigma- 
AIdrich) or ovalbumin reference were immobilized at 50 |jLg/ 
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mL in 10 mM acetate buffer, pH 4.5. For Vj^s at 20 |jlM, 2,100 
RUs of protein L and 1,200 RUs of ovalbumin reference were 
immobilized at 100 |jLg/mL in acetate buffer, pH 4.0, and 50 
|Jug/mL in 10 mM acetate buffer, pH 4.5, respectively. For V^s 
at 20 jjlM, 650-1,100 RUs of protein A and 600-1,200 RUs of 
ovalbumin were immobilized at 50 |JLg/mL in 10 mM acetate 
buffer, pH 4.5. In all instances, the analyses were performed at 25 
°C in HBS-EP buffer at 40 |xL/min. The surfaces were washed 
thoroughly with the running buffer for regeneration. Data were 
analyzed with BIAevaluation 4.1 software and K^s were analyzed 
with 1 : 1 binding models. Refolding efficiencies were expressed in 
terms of the ratio of (refolded) /K^ (native).'"* 
T measurements 

m 

Circular dichroism spectra and 7^s of V^^s and V^jS were 
obtained as described previously''^''''' using a Jasco J-815 spec- 
tropolarimeter (Jasco) equipped with a Peltier thermoelectric 
type temperature control system. The circular dichroism spectra 
were recorded over a temperature range of 30 °C to 96 °C in 
100 mM sodium phosphate buffer, pH 7.4, and with a protein 
concentration of 50 |xg/mL. EUipticity changes at 203 nm for 



HVLP325 and HVLP389, at 218 nm for HVLP324, HVLP335, 
HVLP342, HVLP351, HVLP364, and HVLP3103, at 210 nm 
for Cys mutant V^s, at 205 nm for HVHP421, HVHP428, 
and HVHP429, at 200 nm for HVHP413, HVHP419, and 
HVHP420, and at 202 nm for HVHP44 and HVHP414 were 
used for constructing thermal unfolding curves and subsequent 
calculations of T s. At these wavelengths, wide differences in 

m D 

ellipticity values between the folded (at 30 °C) and fully dena- 
tured (at 90 °C) domains allowed reliable determination of 7" s 
(Fig. S6). 
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